Stress-Testing General Purpose Digital Library Software
نویسندگان
چکیده
DSpace, Fedora, and Greenstone are three widely used open source digital library systems. In this paper we report on scalability tests performed on these tools by ourselves and others. These range from repositories populated with synthetically produced data to real world deployment with content measured in millions of items. A case study is presented that details how one of the systems performed when used to produce fully-searchable newspaper collections containing in excess of 20 GB of raw text (2 billion words, with 60 million unique terms), 50 GB of metadata, and 570 GB of images.
منابع مشابه
General purpose medical digital library definition
1 The need of an approach for the definition of a platform-independent medical digital library, using only 2 open-source tools, will be described. To test the need and the success of such an approach, a library will 3 be created, which can later be used in a larger scale as a general purpose digital medical tool, when comes 4 the need to evaluate an image. 5 As a first test, the library will be...
متن کاملGeneral-Purpose Digital Library Content Laboratory Systems1
The last decade witnessed a proliferation of systems specially devised for aggregating and then operating over information objects – e.g., publications, experimental data, multimedia and compound objects – collected from possibly heterogeneous and autonomous data sources. Such systems, to which we refer as “Digital Library Content Laboratories”, are typically highly domain-specific and thus fea...
متن کاملDIGITAL LIBRARIES: THE SYSTEMS ANALYSIS PERSPECTIVE Cataloging for the masses
Purpose – The purpose of this paper is to explore methods for opening up web content to automated classification using metadata, potentially in the context of library groupware or portals. Design/methodology/approach – Examines various web sites and meta-searching tools which provides a new means of access for users, and allow users to better document and integrate their research findings. Find...
متن کاملDSPSR: Digital Signal Processing Software for Pulsar Astronomy
DSPSR is a high-performance, open-source, object-oriented, digital signal processing software library and application suite for use in radio pulsar astronomy. Written primarily in Cþþ, the library implements an extensive range of modular algorithms that can optionally exploit both multiple-core processors and general-purpose graphics processing units. After over a decade of research and develop...
متن کاملUsing Open Source Software for Digital Libraries: A Case Study of CUSAT
Purpose – The purpose of this paper is to describe the design and development of a digital library at Cochin University of Science and Technology (CUSAT), India, using DSpace open source software. The study covers the structure, contents and usage of CUSAT digital library. Design/methodology/approach – This paper examines the possibilities of applying open source in libraries. An evaluative app...
متن کامل